CDS

Accession Number TCMCG068C41388
gbkey CDS
Protein Id KAG5609259.1
Location join(9452204..9452270,9454063..9454131,9454282..9454341,9454493..9454591,9464454..9464565,9464717..9464782,9464890..9464923,9465005..9465073,9465716..9465790)
Organism Solanum commersonii
locus_tag H5410_020540

Protein

Length 216aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA655804, BioSample:SAMN15755581
db_source JACXVP010000004.1
Definition hypothetical protein H5410_020540 [Solanum commersonii]
Locus_tag H5410_020540

EGGNOG-MAPPER Annotation

COG_category K
Description DNA-directed RNA polymerases IV and V subunit 4
KEGG_TC -
KEGG_Module M00180        [VIEW IN KEGG]
KEGG_Reaction R00435        [VIEW IN KEGG]
R00441        [VIEW IN KEGG]
R00442        [VIEW IN KEGG]
R00443        [VIEW IN KEGG]
KEGG_rclass RC02795        [VIEW IN KEGG]
BRITE br01611        [VIEW IN KEGG]
ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K03012        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko00240        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko03020        [VIEW IN KEGG]
ko05016        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map03020        [VIEW IN KEGG]
map05016        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGGAGAAAGGAGGAAAAGGGTTTCCTTTGCCGAAAAGTGGAAAGTCTGCTCTGAAATCCCCTGCATCCAAAGGGAAGGATGATAGCTCAGCGAAGTCCAAAAGAGGAAGGAAAGTTCAGTTTGATTCTGAAGGATCGCTTGATACCAATTCCACAAAATCAAATGGAAAAGCTGATATATCATCTTTCAAAGGTGATTTGGGCAAAGCCGGGAAAGGAGAGAAAGCTGGCAGTGCTGGTAAAAGTCAAAAAGCAAAAGCACCTGATCCCTTGGAGCTGAGAGTTGAGCAAGAACTTCCAGCCAATACAACATGCCTGATGGATTGTGAAGCTGCTGATATTTTGCAAGGAATCCAAGAGAACATGGTGGTATTGTCTGATGATCCAGCTATAAAACTACCTGTGGGATTGGCATATGCTCAAAGGAACAGGCTTTATGATAATCCCCAGGCTGTTAAACAAATACTCGAGCCTCTAAAACAGCACGGCGTTTCTGATGGGGAGCTTTGCATGATTGCCAACTTTCCCTTGGAATCTGTTGATGAAGTGTTTGCTCTTGTTCCCTCATTTAAGAATAAAAAGAGTAAGCTGAGAGTTCCCCTCGAGAAAGTCTTGGCTGAACTGGCCAAACTTAGAAAGGCAGCATAA
Protein:  
MAEKGGKGFPLPKSGKSALKSPASKGKDDSSAKSKRGRKVQFDSEGSLDTNSTKSNGKADISSFKGDLGKAGKGEKAGSAGKSQKAKAPDPLELRVEQELPANTTCLMDCEAADILQGIQENMVVLSDDPAIKLPVGLAYAQRNRLYDNPQAVKQILEPLKQHGVSDGELCMIANFPLESVDEVFALVPSFKNKKSKLRVPLEKVLAELAKLRKAA